Selective memory cell program and erase

ABSTRACT

Techniques are disclosed herein for programming memory arrays to achieve high program/erase cycle endurance. In some aspects, only selected word lines (WL) are programmed with other WLs remaining unprogrammed. As an example, only the even word lines are programmed with the odd WLs left unprogrammed. After all of the even word lines are programmed and the data block is to be programmed with new data, the block is erased. Later, only the odd word lines are programmed. The data may be transferred to a block that stores multiple bit per memory cell prior to the erase. In one aspect, the data is programmed in a checkerboard pattern with some memory cells programmed and others left unprogrammed. Later, after erasing the data, the previously unprogrammed part of the checkerboard pattern is programmed with remaining cells unprogrammed.

BACKGROUND OF THE INVENTION

1. Field of the Invention

The present invention relates to non-volatile memory.

2. Description of the Related Art

Semiconductor memory has become increasingly popular for use in variouselectronic devices. For example, non-volatile semiconductor memory isused in cellular telephones, digital cameras, personal digitalassistants, mobile computing devices, non-mobile computing devices andother devices. Electrically Erasable Programmable Read Only Memory(EEPROM) and flash memory are among the most popular non-volatilesemiconductor memories. With flash memory, also a type of EEPROM, thecontents of the whole memory array, or of a portion of the memory, canbe erased in one step, in contrast to the traditional, full-featuredEEPROM.

Both the traditional EEPROM and the flash memory utilize a floating gatethat is positioned above and insulated from a channel region in asemiconductor substrate. The floating gate is positioned between thesource and drain regions. A control gate is provided over and insulatedfrom the floating gate. The threshold voltage (V_(TH)) of the transistorthus formed is controlled by the amount of charge that is retained onthe floating gate. That is, the minimum amount of voltage that must beapplied to the control gate before the transistor is turned on to permitconduction between its source and drain is controlled by the level ofcharge on the floating gate.

Some EEPROM and flash memory devices have a floating gate that is usedto store two ranges of charges and, therefore, the memory element can beprogrammed/erased between two states, e.g., an erased state and aprogrammed state. Typically, memory cells having a threshold voltagewithin a first voltage range are considered to be in the erased stateand those having a threshold voltage within a second voltage range areconsidered to be in the programmed state. Typically, there is a windowbetween the first and second range. Such a flash memory device issometimes referred to as a binary flash memory device because eachmemory element can store one bit of data.

A multi-state (also called multi-level) flash memory device isimplemented by identifying multiple distinct allowed/valid programmedthreshold voltage ranges. Each distinct threshold voltage rangecorresponds to a predetermined value for the set of data bits encoded inthe memory device. For example, each memory element can store two bitsof data when the element can be placed in one of four discrete chargebands corresponding to four distinct threshold voltage ranges.

Some flash memory devices operate as both binary and multi-states. Forexample, some memory cells are used to store one bit of data(“single-level cell or SLC blocks”) and other memory cells are used tostore multiple bits per cell (“multi-level cell or MLC blocks”). Forsome devices, the SLC blocks and MLC blocks are part of the sameintegrated circuit, and may even be part of the same memory array. TheSLC blocks may be used for short term storage of data, whereas the MLCblocks may be used for long term data storage. In other words, the SLCblocks might be used somewhat like a cache. Thus, the SLC blocks may beprogrammed/erased many more times over the life of the device than MLCblocks. Therefore, write/erase endurance may be a more significantproblem for SLC blocks than for MLC blocks.

For some memory arrays, the array is arranged as a number of parallelword lines and a number of bit lines that run perpendicular to the wordlines. Each memory cell may be associated with one word line and one bitline. In certain situations, a memory cell can be affected by the chargestored on the floating gate of an adjacent memory cell on a neighboringword line and/or neighboring bit line.

Shifts in the apparent charge stored on a floating gate of a memory cellcan occur because of the coupling of an electric field due to the chargestored in adjacent floating gates. This phenomenon is described in U.S.Pat. No. 5,867,429, which is incorporated herein by reference in itsentirety. The problem occurs most pronouncedly between sets of adjacentmemory cells that have been programmed at different times. For example,a first memory cell is programmed to add a level of charge to itsfloating gate that corresponds to one set of data. Subsequently, one ormore adjacent memory cells are programmed to add a level of charge totheir floating gates that correspond to a second set of data. After theone or more of the adjacent memory cells are programmed, the chargelevel read from the first memory cell appears to be different thanprogrammed because of the effect of the charge on the adjacent memorycells being coupled to the first memory cell. The coupling from adjacentmemory cells can shift the apparent charge level being read a sufficientamount to lead to an erroneous reading of the data stored. Herein, thisadjacent floating gate to floating gate effect may be referred to as onetype of “adjacent floating gate charge coupling effect.”

The charge on an adjacent floating gate can also interfere with theconductive channel in the substrate below the floating gate of a memorycell. Specifically, the charge on the adjacent floating gate may impacthow strongly the channel of another memory cell conducts a current.Thus, if the charge stored in an adjacent floating gate changes, then itmay require a greater (or smaller) voltage on the control gate the othermemory cell to create the same current in the channel. The net impact isthat the amount of charge stored on the memory cell appears to bedifferent due to the change in the charge stored in the adjacentfloating gate. This problem is most pronounced between sets of adjacentmemory cells that have been programmed at different times. Herein, thisadjacent floating gate to channel effect may be referred to as anothertype of “adjacent floating gate charge coupling effect.”

Another problem with memory cells is that over time charge canaccumulate in a dielectric near the floating gate. For example, whenprogramming a memory cell, charge can become trapped in a tunnel oxidelayer below the floating gate of the memory cell. Erasing the memorycell may not completely remove the trapped charge. With eachprogram/erase cycle, the amount of trapped charge increases.

As memory cells continue to shrink in size, the associated reduction inspace between memory cells may increase the adjacent floating gatecharge coupling effects. As the number of program/erase cyclesincreases, the charge trapping around adjacent floating gatesexacerbates the floating gate charge coupling effects. For memory cellswhich undergo many program/erase cycles, the large adjacent floatinggate charge coupling effects severely shrinks the difference between thethreshold voltage ranges. For example, the gap between the range ofthreshold voltages that represents a “1” and “0” decreases. To guaranteereliability and avoid read errors, there should be a certain amount ofthreshold voltage separation between the “1” state and the “0” state.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 is a top view of a NAND string.

FIG. 2 is a circuit diagram of three NAND strings with associated wordlines.

FIG. 3 is a block diagram of an array of NAND flash storage elements.

FIG. 4 is a block diagram depicting one embodiment of a memory array.

FIG. 5 is a block diagram depicting one embodiment of a sense block.

FIG. 6A-6C depict example threshold voltage distributions.

FIG. 7A depicts one embodiment of a process of programming a block ofmemory cells in a memory array.

FIG. 7B depicts another embodiment of a process of programming a blockof memory cells in a memory array.

FIGS. 8A and 8B depict example patterns that result after programmingmemory cells on odd and even word lines.

FIG. 9A depicts one embodiment of a process programming SLC and MLCblocks.

FIG. 9B depicts another embodiment of a process programming SLC and MLCblocks.

FIG. 10 depicts one embodiment of a flowchart of a process ofprogramming a block of memory cells in a memory array in a checkerboardpattern.

FIGS. 11A and 11B depict example checkerboard pattern that result afterprogramming memory cells on odd and even word lines.

FIG. 12 depicts one embodiment of a process programming SLC and MLCblocks using a checkerboard pattern.

FIG. 13 is a flow chart describing details of programming memory cells.

FIG. 14 is a flow chart describing details of erasing memory cells.

DETAILED DESCRIPTION

Techniques are disclosed herein for programming memory arrays in a waythat achieves high program/erase cycle endurance. Techniques reducefloating gate charge coupling effects between word lines, which canincrease the endurance of memory cells. Techniques reduce floating gatecharge coupling effects between bit lines, which can increase theendurance of memory cells. Techniques provide for a wide window betweenthreshold voltage distribution states. In some aspects, the techniquesare applied to SLC blocks in a memory array that also includes MLCblocks.

In one aspect, only certain selected word lines (WL) in the block areprogrammed. Other WLs are left erased (unprogrammed). This reduces oreliminates WL-WL floating gate charge coupling effects. Initially, theentire group of memory cells (e.g., block) may be erased. When a WL isprogrammed both neighboring WLs (WLn−1 and WLn+1) are left in the erasedstate without being programmed. In this example, “n” might be evenintegers or, alternatively, odd integers. As an example, only the evenword lines are programmed. After all of the even word lines areprogrammed and the data block is to be programmed with new data, theblock is erased. Note that when erasing the block, memory cells on theodd WLs do not need to be erased which reduces stress on memory cells onthe odd WLs. Later, only odd WLs are programmed.

One aspect is operating memory arrays having SLC blocks and MLC blocks.In one aspect, the even/odd programming discussed in the previousparagraph is applied to SLC blocks. After, for example, memory cells oneven WLs in several SLC blocks are programmed with data, the data istransferred to one or more MLC blocks. Then, the SLC blocks are erasedand the odd word lines in the SLC blocks may be programmed.

In one aspect, programming a block or other unit is performed in acheckerboard pattern. For example, on the even WLs, only the even memorycells are programmed and on the odd WLs only the odd memory cells areprogrammed. Later, when new data is to be stored in the block, thecheckerboard pattern is reversed such that on even WLs only the oddmemory cells are programmed and on the odd WLs only the even memorycells are programmed. This programming scheme may reduce or eliminateboth WL-WL floating gate charge coupling effects, as well as bit line tobit line floating gate charge coupling effects.

In one aspect, the checkerboard pattern programming discussed in theprevious paragraph is applied to SLC blocks. After, for example, severalSLC blocks are programmed using the checkerboard pattern, the data istransferred to one or more MLC blocks. Then, the SLC blocks are erasedand the inverse of the checkerboard pattern may be used to program theSLC blocks.

The techniques described herein are applicable to a wide range of memoryarrays. The following is one example NAND architecture. However,techniques described herein are not limited to this example. One exampleof a flash memory system uses the NAND structure, which includesarranging multiple floating gate transistors in series between twoselect gates. The transistors in series and the select gates arereferred to as a NAND string. FIG. 1 is a top view showing one NANDstring. The NAND string depicted in FIG. 1 includes four transistors100, 102, 104 and 106 in series and sandwiched between a first (or drainside) select gate 120 and a second (or source side) select gate 122.Select gate 120 connects the NAND string to a bit line via bit linecontact 126. Select gate 122 connects the NAND string to source line128. Select gate 120 is controlled by applying the appropriate voltagesto select line SGD. Select gate 122 is controlled by applying theappropriate voltages to select line SGS. Each of the transistors 100,102, 104 and 106 has a control gate and a floating gate. For example,transistor 100 has control gate 100CG and floating gate 100FG.Transistor 102 includes control gate 102CG and a floating gate 102FG.Transistor 104 includes control gate 104CG and floating gate 104FG.Transistor 106 includes a control gate 106CG and a floating gate 106FG.Control gate 100CG is connected to word line WL3, control gate 102CG isconnected to word line WL2, control gate 104CG is connected to word lineWL1, and control gate 106CG is connected to word line WL0.

A typical architecture for a flash memory system using a NAND structurewill include many NAND strings. Each NAND string is connected to thesource line by its source select gate controlled by select line SGS andconnected to its associated bit line by its drain select gate controlledby select line SGD. Each bit line and the respective NAND string(s) thatare connected to that bit line via a bit line contact comprise thecolumns of the array of memory cells. Bit lines are shared with multipleNAND strings. Typically, the bit line runs on top of the NAND strings ina direction perpendicular to the word lines and is connected to one ormore sense amplifiers.

FIG. 2 shows three NAND strings 302, 304 and 306 of a memory arrayhaving many more NAND strings. Each of the NAND strings of FIG. 2includes two select transistors and four memory cells. For example, NANDstring 302 includes select transistors 320 and 330, and memory cells322, 324, 326 and 328. NAND string 304 includes select transistors 340and 350, and memory cells 342, 344, 346 and 348. Each NAND string isconnected to the source line by its select transistor (e.g. selecttransistor 330 and select transistor 350). A selection line SGS is usedto control the source side select gates. The various NAND strings areconnected to respective bit lines by select transistors 320, 340, etc.,which are controlled by select line SGD. In other embodiments, theselect lines do not necessarily need to be in common. Word line WL3 isconnected to the control gates for memory cell 322 and memory cell 342.Word line WL2 is connected to the control gates for memory cell 324,memory cell 344, and memory cell 352. Word line WL1 is connected to thecontrol gates for memory cell 326 and memory cell 346. Word line WL0 isconnected to the control gates for memory cell 328 and memory cell 348.As can be seen, each bit line and the respective NAND string comprisethe columns of the array of memory cells. The word lines (WL3, WL2, WL1and WL0) comprise the rows of the array.

Note that a NAND string can have fewer or more memory cells thandepicted in FIG. 2. For example, some NAND strings will include eightmemory cells, 16 memory cells, 32 memory cells, 64 memory cells, 128memory cells, etc. The discussion herein is not limited to anyparticular number of memory cells in a NAND string. Furthermore, a wordline can have more or fewer memory cells than depicted in FIG. 2. Forexample, a word line can include thousand or tens of thousands of memorycells. The discussion herein is not limited to any particular number ofmemory cells in a word line.

Each memory cell can store data (analog or digital). When storing onebit of digital data, the range of possible threshold voltages of thememory cell is divided into two ranges which are assigned logical data“1” and “0.” In one example of a NAND type flash memory, the thresholdvoltage is negative after the memory cell is erased, and defined aslogic “1.” The threshold voltage after programming is positive anddefined as logic “0.” When the threshold voltage is negative and a readis attempted by applying 0 volts to the control gate, the memory cellwill turn on to indicate logic one is being stored. When the thresholdvoltage is positive and a read operation is attempted by applying 0volts to the control gate, the memory cell will not turn on, whichindicates that logic zero is stored.

In the case of storing multiple levels of data, the range of possiblethreshold voltages is divided into the number of levels of data. Forexample, if four levels of information is stored (two bits of data),there will be four threshold voltage ranges assigned to the data values“11”, “10”, “01”, and “00.” In one example of a NAND type memory, thethreshold voltage after an erase operation is negative and defined as“11”. Positive threshold voltages are used for the data states of “10”,“01”, and “00.” If eight levels of information (or states) are stored(e.g. for three bits of data), there will be eight threshold voltageranges assigned to the data values “000”, “001”, “010”, “011” “100”,“101”, “110” and “111.”

The specific relationship between the data programmed into the memorycell and the threshold voltage levels of the cell depends upon the dataencoding scheme adopted for the cells. For example, U.S. Pat. No.6,222,762 and U.S. Patent Application Publication No. 2004/0255090, bothof which are incorporated herein by reference in their entirety,describe various data encoding schemes for multi-state flash memorycells. In one embodiment, data values are assigned to the thresholdvoltage ranges using a Gray code assignment so that if the thresholdvoltage of a floating gate erroneously shifts to its neighboringphysical state, only one bit will be affected. In some embodiments, thedata encoding scheme can be changed for different word lines, the dataencoding scheme can be changed over time, or the data bits for randomword lines may be inverted or otherwise randomized to reduce datapattern sensitivity and even wear on the memory cells.

Relevant examples of NAND type flash memories and their operation areprovided in the following U.S. patents/patent applications, all of whichare incorporated herein by reference: U.S. Pat. No. 5,570,315; U.S. Pat.No. 5,774,397; U.S. Pat. No. 6,046,935; U.S. Pat. No. 6,456,528; andU.S. Pat. Publication No. US2003/0002348. The discussion herein can alsoapply to other types of flash memory in addition to NAND as well asother types of non-volatile memory. For example, the following patentsdescribe NOR type flash memories and are incorporated herein byreference in their entirety: U.S. Pat. Nos. 5,095,344; 5,172,338;5,890,192 and 6,151,248.

Other types of non-volatile storage devices, in addition to NAND flashmemory, can also be used. For example, a so called TANOS structure(consisting of a stacked layer of TaN—Al₂O₃—SiN—SiO₂ on a siliconsubstrate), which is basically a memory cell using trapping of charge ina nitride layer (instead of a floating gate), can also be used with thepresent invention. Another type of memory cell useful in flash EEPROMsystems utilizes a non-conductive dielectric material in place of aconductive floating gate to store charge in a non-volatile manner. Sucha cell is described in an article by Chan et al., “A TrueSingle-Transistor Oxide-Nitride-Oxide EEPROM Device,” IEEE ElectronDevice Letters, Vol. EDL-8, No. 3, March 1987, pp. 93-95. A triple layerdielectric formed of silicon oxide, silicon nitride and silicon oxide(“ONO”) is sandwiched between a conductive control gate and a surface ofa semi-conductive substrate above the memory cell channel. The cell isprogrammed by injecting electrons from the cell channel into thenitride, where they are trapped and stored in a limited region. Thisstored charge then changes the threshold voltage of a portion of thechannel of the cell in a manner that is detectable. The memory cell iserased by injecting hot holes into the nitride. See also Nozaki et al.,“A 1-Mb EEPROM with MONOS Memory Cell for Semiconductor DiskApplication,” IEEE Journal of Solid-State Circuits, Vol. 26, No. 4,April 1991, pp. 497-501, which describes a similar memory cell in asplit-gate configuration where a doped polysilicon gate extends over aportion of the memory cell channel to form a separate select transistor.The foregoing two articles are incorporated herein by reference in theirentirety. The programming techniques mentioned in section 1.2 of“Nonvolatile Semiconductor Memory Technology,” edited by William D.Brown and Joe E. Brewer, IEEE Press, 1998, incorporated herein byreference, are also described in that section to be applicable todielectric charge-trapping devices. Other types of memory devices canalso be used.

FIG. 3 illustrates a non-volatile storage device 210 that may includeone or more memory die or chips 212. Memory die 212 includes an array(two-dimensional or three dimensional) of memory cells 200, controlcircuitry 220, and read/write circuits 230A and 230B. In one embodiment,access to the memory array 200 by the various peripheral circuits isimplemented in a symmetric fashion, on opposite sides of the array, sothat the densities of access lines and circuitry on each side arereduced by half. The read/write circuits 230A and 230B include multiplesense blocks 300 which allow a page of memory cells to be read orprogrammed in parallel. The memory array 200 is addressable by wordlines via row decoders 240A and 240B and by bit lines via columndecoders 242A and 242B. In a typical embodiment, a controller 244 isincluded in the same memory device 210 (e.g., a removable storage cardor package) as the one or more memory die 212. Commands and data aretransferred between the host and controller 244 via lines 232 andbetween the controller and the one or more memory die 212 via lines 234.One implementation can include multiple chips 212.

Control circuitry 220 cooperates with the read/write circuits 230A and230B to perform memory operations on the memory array 200. The controlcircuitry 220 includes a state machine 222, an on-chip address decoder224 and a power control module 226. The state machine 222 provideschip-level control of memory operations. The on-chip address decoder 224provides an address interface to convert between the address that isused by the host or a memory controller to the hardware address used bythe decoders 240A, 240B, 242A, and 242B. The power control module 226controls the power and voltages supplied to the word lines and bit linesduring memory operations. In one embodiment, power control module 226includes one or more charge pumps that can create voltages larger thanthe supply voltage.

The memory array 200 includes an MLC block region 200 a and an SLC blockregion 200 b. An SLC block and an MLC block may have the same number ofmemory cells for user data; however, because an MLC block storesmultiple bits per cell an MLC block may store 2, 3, 4, etc. times asmuch data as an SLC block. It is not required that SLC blocks and MLCblocks have the same number of memory cells. Typically, the data storedin the MLC blocks is processed with a stronger ECC algorithm than thatused in SLC blocks in order to provide greater reliability. Such strongECC is generally not required with SLC blocks. A region 200 b of thememory array 200 having SLC blocks will be referred to as “an SLC region200 b” and a region 200 a of the memory array 200 having MLC blocks willbe referred to as “an MLC region 200 a.” Note that in some embodiments,the SLC block area 200 b and MLC block area 200 a do not have to bepredefined areas. In some embodiments, all blocks in memory array 200can be used as either SLC or MLC blocks. For example, a block can beused as an SLC block at one time and as an MLC block at another time. Inother embodiments, the controller 244 defines certain blocks as SLC andMLC blocks respectively.

In some embodiments, when the controller 244 receives user data it isfirst stored in one or more SLC blocks. However, the controller 244 doesnot necessarily program all of the memory cells in the SLC block. In oneaspect, the controller 244 programs only selected word lines (e.g., onlyodd or only even WLs). In one aspect, the controller 244 programs memorycells in a checkerboard pattern. At some point, the controller 244 maytransfer the user data stored in the SLC blocks into one or more MLCblocks. As an example, if the MLC blocks each store two times as muchdata as an SLC block is capable of storing, the controller 244 may waituntil four SLC blocks are programmed and then read in that data, performECC encoding, and store the data into a single MLC block. Note that itis not required that all of the memory cells in the SLC block areprogrammed prior to the transfer to the MLC block. Also note that thistechnique may result in data being stored in MLC blocks for longerperiods of time than in SLC blocks, but that is not required.

In one embodiment, one or any combination of control circuitry 220,power control circuit 226, decoder circuit 224, state machine circuit222, decoder circuit 242A, decoder circuit 242B, decoder circuit 240A,decoder circuit 240B, read/write circuits 230A, read/write circuits230B, and/or controller 244 can be referred to as one or more managingcircuits.

FIG. 4 depicts an exemplary structure of memory cell array 200. In oneembodiment, the array of memory cells is divided into M blocks of memorycells. As is common for flash EEPROM systems, the block is the unit oferase. That is, each block contains the minimum number of memory cellsthat are erased together. Each block is typically divided into a numberof pages. A page is a unit of programming. One or more pages of data aretypically stored in one row of memory cells. A page can store one ormore sectors. A sector includes user data and overhead data. Overheaddata typically includes an Error Correction Code (ECC) that has beencalculated from the user data of the sector. A portion of the controller(described below) calculates the ECC when data is being programmed intothe array, and also checks it when data is being read from the array.Alternatively, the ECCs and/or other overhead data are stored indifferent pages, or even different blocks, than the user data to whichthey pertain. A sector of user data is typically 512 bytes,corresponding to the size of a sector in magnetic disk drives. A largenumber of pages form a block, anywhere from 8 pages, for example, up to32, 64, 128 or more pages. Different sized blocks and arrangements canalso be used.

In another embodiment, the bit lines are divided into odd bit lines andeven bit lines. In an odd/even bit line architecture, memory cells alonga common word line and connected to the odd bit lines are programmed atone time, while memory cells along a common word line and connected toeven bit lines are programmed at another time.

FIG. 4 also shows more details of block i of memory array 200. Block iincludes X+1 bit lines and X+1 NAND strings. Block i also includes 64data word lines (WL0-WL63), 2 dummy word lines (WL_d0 and WL_d1), adrain side select line (SGD) and a source side select line (SGS). Oneterminal of each NAND string is connected to a corresponding bit linevia a drain select gate (connected to select line SGD), and anotherterminal is connected to the source line via a source select gate(connected to select line SGS). Because there are sixty four data wordlines and two dummy word lines, each NAND string includes sixty fourdata memory cells and two dummy memory cells. In other embodiments, theNAND strings can have more or fewer than 64 data memory cells and moreor fewer dummy memory cells. Data memory cells can store user or systemdata. Dummy memory cells are typically not used to store user or systemdata. Some embodiments do not include dummy memory cells.

FIG. 5 is a block diagram of an individual sense block 300 partitionedinto a core portion, referred to as a sense module 480, and a commonportion 490. In one embodiment, there will be a separate sense module480 for each bit line and one common portion 490 for a set of multiplesense modules 480. In one example, a sense block will include one commonportion 490 and eight sense modules 480. Each of the sense modules in agroup will communicate with the associated common portion via a data bus472. For further details, refer to U.S. Patent Application Publication2006/0140007, filed Dec. 29, 2004, and titled, “Non-volatile memory andmethod with shared processing for an aggregate of read/write circuits,”which is herby incorporated herein by reference in its entirety.

Sense module 480 comprises sense circuitry 470 that determines whether aconduction current in a connected bit line is above or below apredetermined threshold level. In some embodiments, sense module 480includes a circuit commonly referred to as a sense amplifier. Sensemodule 480 also includes a bit line latch 482 that is used to set avoltage condition on the connected bit line. For example, apredetermined state latched in bit line latch 482 will result in theconnected bit line being pulled to a state designating program inhibit(e.g., Vdd).

Common portion 490 comprises a processor 492, a set of data latches 494and an I/O Interface 496 coupled between the set of data latches 494 anddata bus 420. Processor 492 performs computations. For example, one ofits functions is to determine the data stored in the sensed memory celland store the determined data in the set of data latches. The set ofdata latches 494 is used to store data bits determined by processor 492during a read operation. It is also used to store data bits importedfrom the data bus 420 during a program operation. The imported data bitsrepresent write data meant to be programmed into the memory. I/Ointerface 496 provides an interface between data latches 494 and thedata bus 420.

During read or sensing, the operation of the system is under the controlof state machine 222 that controls the supply of different control gatevoltages to the addressed cell. As it steps through the variouspredefined control gate voltages corresponding to the various memorystates supported by the memory, the sense module 480 may trip at one ofthese voltages and an output will be provided from sense module 480 toprocessor 492 via bus 472. At that point, processor 492 determines theresultant memory state by consideration of the tripping event(s) of thesense module and the information about the applied control gate voltagefrom the state machine via input lines 493. It then computes a binaryencoding for the memory state and stores the resultant data bits intodata latches 494. In another embodiment of the core portion, bit linelatch 482 serves double duty, both as a latch for latching the output ofthe sense module 480 and also as a bit line latch as described above.

It is anticipated that some implementations will include multipleprocessors 492. In one embodiment, each processor 492 will include anoutput line (not depicted in FIG. 5) such that each of the output linesis wired-OR'd together. In some embodiments, the output lines areinverted prior to being connected to the wired-OR line. Thisconfiguration enables a quick determination during the programverification process of when the programming process has completedbecause the state machine receiving the wired-OR line can determine whenall bits being programmed have reached the desired level. For example,when each bit has reached its desired level, a logic zero for that bitwill be sent to the wired-OR line (or a data one is inverted). When allbits output a data 0 (or a data one inverted), then the state machineknows to terminate the programming process. In embodiments where eachprocessor communicates with eight sense modules, the state machine may(in some embodiments) need to read the wired-OR line eight times, orlogic is added to processor 492 to accumulate the results of theassociated bit lines such that the state machine need only read thewired-OR line one time.

During program or verify, the data to be programmed is stored in the setof data latches 494 from the data bus 420. The program operation, underthe control of the state machine, comprises a series of programmingvoltage pulses (with increasing magnitudes) applied to the control gatesof the addressed memory cells. Each programming pulse is followed by averify process to determine if the memory cell has been programmed tothe desired state. Processor 492 monitors the verified memory staterelative to the desired memory state. When the two are in agreement,processor 492 sets the bit line latch 482 so as to cause the bit line tobe pulled to a state designating program inhibit. This inhibits the cellcoupled to the bit line from further programming even if it is subjectedto programming pulses on its control gate. In other embodiments theprocessor initially loads the bit line latch 482 and the sense circuitrysets it to an inhibit value during the verify process.

Data latch stack 494 contains a stack of data latches corresponding tothe sense module. In one embodiment, there are 3-5 (or another number)data latches per sense module 480. In one embodiment, the latches areeach one bit. In some implementations (but not required), the datalatches are implemented as a shift register so that the parallel datastored therein is converted to serial data for data bus 420, and viceversa. In one preferred embodiment, all the data latches correspondingto the read/write block of m memory cells can be linked together to forma block shift register so that a block of data can be input or output byserial transfer. In particular, the bank of read/write modules isadapted so that each of its set of data latches will shift data in to orout of the data bus in sequence as if they are part of a shift registerfor the entire read/write block.

Additional information about the read operations and sense amplifierscan be found in (1) U.S. Pat. No. 7,196,931, “Non-Volatile Memory AndMethod With Reduced Source Line Bias Errors,”; (2) U.S. Pat. No.7,023,736, “Non-Volatile Memory And Method with Improved Sensing,”; (3)U.S. Patent Application Pub. No. 2005/0169082; (4) U.S. Pat. No.7,196,928, “Compensating for Coupling During Read Operations ofNon-Volatile Memory,” and (5) United States Patent Application Pub. No.2006/0158947, “Reference Sense Amplifier For Non-Volatile Memory,”published on Jul. 20, 2006. All five of the immediately above-listedpatent documents are incorporated herein by reference in their entirety.

At the end of a successful programming process (with verification), thethreshold voltages of the memory cells should be within one or moredistributions of threshold voltages for programmed memory cells orwithin a distribution of threshold voltages for erased memory cells, asappropriate. FIG. 6A depicts example Vt distributions for states ofmemory cells in an SLC block. FIG. 6B illustrates example Vtdistributions corresponding to data states for the memory cell arraywhen each memory cell stores four bits of data. Such a distribution maybe used for programming an MLC block. Other embodiments, however, mayuse more or fewer than four bits of data per memory cell. FIG. 6B shows16 Vt distributions corresponding to data states 0-15. In someembodiments, the threshold voltages in state 0 are negative and thethreshold voltages in the states 1-15 are positive. However, thethreshold voltages in one or more of states 1-15 may be negative.

Between each of the data states 0-15 are read reference voltages usedfor reading data from memory cells. For example, FIG. 6B shows readreference voltage Vr1 between data states 0 and 1, and Vr2 between datastates 1 and 2. By testing whether the threshold voltage of a givenmemory cell is above or below the respective read reference voltages,the system can determine what state the memory cell is in.

At or near the lower edge of each data state 0-15 are verify referencevoltages. For example, FIG. 6B shows Vv1 for state 1 and Vv2 for state2. When programming memory cells to a given state, the system will testwhether those memory cells have a threshold voltage greater than orequal to the verify reference voltage.

FIG. 6C illustrates that another embodiment of Vt distributionscorresponding to data states 0-15 can partially overlap because an errorcorrection algorithm can handle a certain percentage of cells that arein error. Also note that the Vt axis may be offset from actual voltagesapplied to the control gates as body effect through source or bodybiasing is used to shift negative threshold voltage into the measurablepositive range. Another point to note is that contrary to the equalspacing/width of the depicted sixteen states, various states may havedifferent widths/spacings in order to accommodate varying amounts ofsusceptibility to data retention loss. In some embodiments, states 0and/or 15 are wider than the other states.

Referring again to FIG. 6A, State 0 may be an erase distribution thatresults from erasing all memory cells in an SLC block. The eraseverification voltage is not explicitly depicted but may be just at theright edge of the erase distribution. When an SLC block is programmed,the system moves the threshold voltage of selected memory cells todistribution 1. The system verifies whether memory cells are programmedto a threshold voltage of Vv for state 1. After programming has beencompleted, the system reads memory cells by comparing their thresholdvoltage with Vr.

FIG. 7A depicts one embodiment of a flowchart of a process 700 ofprogramming a block of memory cells in a memory array 200. In oneembodiment, process 700 is applied to SLC blocks but not to MLC blocks.However, process 700 may also be applied to MLC blocks.

In step 702, a block of memory cells in the memory array is erased. Inone aspect, the block is an SLC block, but process 700 in not limited toSLC blocks. Thus, in one aspect, the block is an MLC block. Details oferasing a block of memory cells are discussed below in connection withthe discussion of FIG. 14. In step 702, all memory cells in the blockare erased.

In step 704, memory cells associated with even word lines areprogrammed. However, memory cells on the odd word lines remain erased(unprogrammed). FIG. 8A depicts an example pattern that results afterprogramming memory cells on even word lines, but keeping memory cells onodd word lines erased. The memory cells that are encircled by dashedlines are those that are programmed with data. In FIG. 8A, on WL1, WL3and WL5 the memory cells all have a “0” indicating that they haveremained erased. Memory cells on WL0, WL2 and WL4 have either a “1” or a“0” indicating that those memory cells have been programmed with data.Herein, the phrase “programming a group of memory cells”, “programmingdata in a group of memory cells”, “a programmed word line,” or similarphrases will be understood to mean that the threshold voltage of thememory cells are set to the appropriate level to represent data. It willbe understood that there may be some memory cells for which thethreshold voltage does not need to change in order to program data intothat memory cell. For example, some of the memory cells on WL0, WL2, WL4have had their threshold voltage changed from the erase thresholdvoltage (state 0 in FIG. 6A) to a programmed threshold voltage (e.g.,state 1 in FIG. 6A). However, some memory cells on WL0, WL2, WL4 remainin the erase threshold voltage (e.g., state 0 in FIG. 6A) to represent abinary “0”. Note that the erased state could also represent binary “1.”Also note that programming could involve changing the threshold voltageto another state such as any of states 2-15 in FIGS. 6B and 6C. In otherwords, the block might be an MLC block. Typically, there will be manymore word lines in a block. For example, there might be 64 or more wordlines. Also, there are typically many more bit lines in a block. Forexample, there might be thousands of bit lines.

In step 706, memory cells on at least the even word lines are erased.Because the memory cells on the odd word lines were not programmed sincethe complete block erase in step 702, those memory cells do not need tobe erased. Details of erasing memory cells on selected word lines arediscussed with respect to FIG. 14. In some embodiments, memory cells onthe even word lines are erased in a normal manner, whereas memory cellson odd word lines are only weakly erased. Weak erase is discussed inmore detail below. The erase may be triggered by a variety of events. Inone aspect, SLC blocks are programmed until enough are programmed towarrant transfer to one or more MLC blocks. After the data transfer toMLC blocks, the data in the SLC blocks may be erased. The erase mightalso be triggered by the host sending a command to the controller 244that indicates that all data in the block is to be erased or writtenwith new data. Thus, it is not required that the data from the block betransferred to another block prior to the erase. Selectively erasingmemory cells on certain word lines (e.g., only even word lines) isdiscussed in connection with FIG. 14.

In step 708, memory cells on odd word lines are programmed. However,memory cells on the even word lines remain erased (or unprogrammed).FIG. 8B depicts an example pattern that results after programming memorycells on odd word lines, but keeping memory cells on even word lineserased. In FIG. 8B, for WL0, WL1 and WL3 the memory cells all have a “0”indicating that they are erased. Memory cells on WL1, WL3 and WL5 haveeither a “1” or a “0” indicating that those memory cells have beenprogrammed. For example, some of the memory cells have had theirthreshold voltage changed from the erase threshold voltage (state 0 inFIG. 6A) to a programmed state (e.g., state 1 in FIG. 6A). Note thatprogramming could involve changing the threshold voltage to anotherstate such as any of states 2-15 in FIGS. 6B and 6C.

In step 710, at least the memory cells on the odd word lines are erased.This is similar to step 706 for even word lines and will not bediscussed in detail. Note that it is not required that an alternatingsequence of programming odd, then even, then odd, then even word linesbe maintained. In one aspect, counts are maintained of the number ofprogram/erase cycles for the odd word lines and for the even word lines.Even WLs might be programmed/erased multiple times in succession priorto programming and erasing the odd WLs. Over time, the even and odd wordlines receive the same number of program/erase cycles to level the wear.A count of the program/erase cycles for even and odd WLs might be storedin free memory cells of the data block itself. That is, there may be acertain number of memory cells in each block that are not for user data.Alternatively, the count could be stored elsewhere, such as a differentblock in the memory array 200 or memory outside of the memory array 200.Thus, in one aspect, steps 704 and 706 (program/erase even WLs) may berepeated many times prior to performing steps 708 and 710 (program/eraseodd WLs).

In other embodiments, rather than programming even and odd WLs, someother pattern is used. In one embodiment, the pattern has at least oneunprogrammed WL between each programmed WL. For example, every thirdword line is programmed while the WLs in between remain erased. As aspecific example, WL0, WL3, WL6, WL9, etc., are programmed, whereas WL1,WL2, WL4, WL5, WL7, WL8, etc., remain erased. After erasing WL0, WL3,WL6, WL9, etc., a new group of word lines is selected for programming.For example, WL1, WL4, WL7, WL10, etc., are programmed, whereas WL0,WL2, WL3, WL5, WL6, WL8, etc., remain erased. Other programmingpatterns, such as a checkerboard pattern are discussed below.

By keeping at least one WL unprogrammed next to each programmed WL, atleast some floating gate charge coupling effects may be reduced oreliminated. For example, those effects that might otherwise arise fromprogramming a memory cell on a WL above or below a given WL are greatlyreduced or eliminated. Reducing this WL-WL floating gate charge couplingeffect may increase the endurance of the block. Furthermore, the windowbetween the states (e.g., the erase state and programmed state) may beincreased.

FIG. 7B depicts one embodiment of a flowchart of a process 750 ofprogramming a block of memory cells in a memory array 200. In oneembodiment, process 750 is applied to SLC blocks but not to MLC blocks.However, process 750 may also be applied to MLC blocks. In process 750,WLs receive a normal erase just before they are programmed, whereas WLsthat are not to be programmed at this time are only weakly erased. TheWLs that are not to be programmed at this time are weakly erased as theymay contain data.

In step 752, even word lines receive a normal erase and odd word linesare weakly erased. In one embodiment, a “weak erase” is achieved byapplying a different bias condition to word lines of memory cells to beweakly erased than the bias condition that is normally applied to theerase memory cells. Providing a normal erase to some memory cells (e.g.,only memory cells on even word lines) while weakly erasing memory cellson other word lines is discussed in connection with FIG. 14. In step704, memory cells associated with even word lines are programmed.However, memory cells on the odd word lines are not programmed. Step 704has already been discussed in connection with process 700. In step 756,odd word lines receive a normal erase and even word lines are weaklyerased. In step 708, memory cells on odd word lines are programmedwithout programming memory cells on the even word lines. Process 750then returns to step 752 to erase even WLs (applying the normal biascondition to word lines, for example) and weakly erase odd word lines.Numerous variations of process 750 similar to the variations of process700 are possible.

FIG. 9A depicts one embodiment of a process 900 programming SLC and MLCblocks. For purposes of discussion, an example of programming eight SLCblocks and transferring that data to one MLC block will be discussed. Inthis example, it takes eight SLC blocks that have only even WLs (oralternatively odd WLs) programmed to fill one MLC block, although it maytake more or fewer SLC blocks to fill an MLC block. In step 902, SLCblocks are erased. Thus, at least the eight SLC blocks are erased.

In step 904, the controller 244 receives data to be stored in the memoryarray 200. For example, a host sends the controller 244 user data tostore. In process 900, the controller 244 determines that the datashould first be stored in SLC blocks prior to transferring the data toMLC blocks. As previously discussed, the SLC blocks may be used as atype of cache to temporarily store the data.

In step 906, memory cells on even WLs of at least one of the SLC blocksare programmed with the received data. The controller 244 may programthe data into more than one SLC block. If so, the controller 244 mightprogram only even WLs of one SLC block and only odd WLs of another SLCblock. Thus, it is not a requirement that at one point in time all ofthe SLC blocks have their even WLs programmed. However, for clarity ofdiscussion an example will be used in which even WLs are programmed ineach of the SLC blocks.

In step 908, the controller 244 determines whether enough SLC blocks areprogrammed to warrant transfer of the data to one or more MLC blocks.For example, the controller 244 determines whether all of the even wordlines on eight SLC blocks are programmed. If not, process 900 returns tostep 904 to receive more data to be stored in the memory array at leastuntil enough SLC blocks are programmed to warrant transfer to MLCblocks.

Note that it is not required that the data from the SLC blocks betransferred to an MLC block as soon as possible. The controller 244 maywait until memory access is idle (e.g., the host is not accessing thememory array 200) to transfer data from SLC blocks to one or more MLCblocks. In this case, other SLC blocks are programmed until anappropriate time to transfer the data to MLC blocks.

When the controller 244 determines it is appropriate, data istransferred from the SLC blocks to one or more MLC blocks, in step 910.Note that at this point, the memory cells on the odd WLs have remainederased (unprogrammed). In the present example, the controller 244 readsin the data from the even WLs of the eight SLC blocks, applies ECC tothe data and then stores the data in the MLC block. It is not requiredthat the even/odd programming of WLs be applied to MLC blocks. Thus,memory cells on every WL of an MLC block may be programmed.

In step 912, memory cells in the SLC blocks are erased. Only those WLsthat were programmed need to be erased. For example, all of the even WLsare erased in each of the SLC blocks for which data was transferred. Insome embodiments, the even WLs receive a normal erase, whereas the oddWLs receive a weak erase. Note that it is not a requirement that onlyhalf the word lines are erased. After step 912, process 900 returns tostep 904 to receive more data to be stored in the memory array 200.However, this time data may be stored in odd WLs of each of the SLCblocks.

As with the example of FIG. 7A, it is not required that a stricteven/odd WL pattern be used in process 900. For example, the even wordlines may be programmed many times prior to programming the odd wordlines. A count of the program/erase cycles for the even and for the oddword lines may be maintained to allow for wear leveling. In otherembodiments, rather than programming even and odd WLs some other patternis used. In one embodiment, the pattern has at least one unprogrammed WLbetween each programmed WL. For example, every third word line isprogrammed while the WLs in between remain erased. As a specificexample, WL0, WL3, WL6, etc., are programmed, whereas WL1, WL2, WL4,WL5, WL7, WL8, etc., remain erased. After erasing WL0, WL3, WL6, etc., anew group of word lines is selected for programming. For example, WL1,WL4, WL7, etc., are programmed, whereas WL0, WL2, WL3, WL5, WL6, WL8,etc., remain erased.

FIG. 9B depicts one embodiment of a process 950 programming SLC and MLCblocks. In this embodiment, word lines are erased just prior toprogramming. Word lines that are not to be programmed may be weaklyerased. Process 950 is similar to process 900 and will not be discussedin detail.

In step 904, the controller 244 receives data to be stored in the memoryarray 200. In step 955, even word lines receive a normal erase and oddword lines are weakly erased. In step 906, memory cells on even WLs ofat least one of the SLC blocks are programmed with the received data.The controller 244 may program the data into more than one SLC block. Ifso, the controller 244 might program only even WLs of one SLC block andonly odd WLs of another SLC block. Thus, it is not a requirement that atone point in time all of the SLC blocks have their even WLs programmed.However, for clarity of discussion an example will be used in which evenWLs are programmed in each of the SLC blocks.

In step 908, the controller 244 determines whether enough SLC blocks areprogrammed to warrant transfer of the data to one or more MLC blocks. Ifnot, process 950 returns to step 904 to receive more data to be storedin the memory array at least until enough SLC blocks are programmed towarrant transfer to MLC blocks. When the controller 244 determines it isappropriate, data is transferred from the SLC blocks to one or more MLCblocks, in step 910. Note that at this point, the memory cells on theodd WLs have remained weakly erased (unprogrammed).

In step 962, memory cells associated with odd WLs receive a normal eraseand memory cells associated with even WLs are weakly erased. After step962, process 950 returns to step 904 to receive more data to be storedin the memory array 200. However, this time data may be stored in oddWLs of each of the SLC blocks (with even WLs remaining unprogrammed).

FIG. 10 depicts one embodiment of a flowchart of a process 1000 ofprogramming a block of memory cells of memory array 200 in acheckerboard pattern. In one embodiment, process 1000 is applied to SLCblocks but not to MLC blocks. However, process 1000 may also be appliedto MLC blocks. In step 1002, a block of memory cells in the memory arrayis erased. Details of erasing a block of memory cells are discussedbelow. In step 1002, all memory cells in the block are erased.

In step 1004, memory cells associated with both even bit lines and evenword lines are programmed and memory cells associated with both odd bitlines and odd word lines are programmed. However, other memory cellsremain erased (unprogrammed). FIG. 11A depicts an example pattern thatresults after programming memory cells a checkerboard pattern. Thememory cells that are encircled by dashed lines are those that arecandidates for programming. In FIG. 11A, the memory cells have beenprogrammed to either a “1” or a “0”. However, programming could involvechanging the threshold voltage to another state such as any of states2-15 in FIGS. 6B and 6C. In other words, the block might be an MLCblock. Typically, there will be many more word lines in a block. Forexample, there might be 64 or more word lines. Also, there are typicallymany more bit lines in a block. For example, there might be thousands ofbit lines.

In step 1006, memory cells are erased. Because there may be programmedmemory cells on each word line, erasing may involve applying eraseconditions to all memory cells. For example, even a memory cell that isstill in the erased state may have erase conditions applied thereto.Details of erasing memory cells are discussed with respect to FIG. 14.The erase may be triggered by a variety of events. In one aspect, SLCblocks are programmed until enough are programmed to warrant transfer toone or more MLC blocks. After the data transfer to MLC blocks, the datain the SLC blocks may be erased. The erase might also be triggered bythe host sending a command to the controller 244 that indicates that alldata in the block is to be erased or written with new data. Thus, it isnot required that the data from the block be transferred to anotherblock prior to the erase.

In step 1008, memory cells that were not programmed in step 1004 areprogrammed. In other words, the other portion of the checkerboardpattern is programmed. However, other memory cells (e.g., thoseprogrammed in step 1004) remain erased (or unprogrammed). FIG. 11Bdepicts an example pattern that results. In that pattern, memory cellsassociated with both odd bit lines and even word lines are programmedand memory cells associated with both even bit lines and odd word linesare programmed. In step 1010, the memory cells are erased. The process1000 may then continue on by returning to step 1004.

Note that it is not required that an alternating sequence of programmingone part of the checkerboard (e.g., FIG. 11A) and then the other part ofthe checkerboard (e.g., FIG. 11B) be maintained. In one aspect, countsare maintained of the number of program/erase cycles for each pattern.The pattern in FIG. 11A might be programmed/erased multiple times insuccession prior to programming and erasing the pattern in FIG. 11B.Over time, each pattern receives the same number of program/erase cyclesto level the wear. A count of the program/erase cycles for each patternmight be stored in a free memory cell of the data block itself. That is,there may be a certain number of memory cells in each block that are notfor user data. Alternatively, the count could be stored elsewhere, suchas a different block in the memory array 200 or memory outside of thememory array 200. Thus, in one aspect, steps 1004 and 1006(program/erase even WLs) may be repeated many times prior to performingsteps 1008 and 1010 (program/erase odd WLs).

Note that in FIGS. 11A and 11B, the neighbor memory cell on the WL aboveand below each programmed memory cell is not programmed. This is alsotrue for the embodiment depicted in FIGS. 8A and 8B. Therefore, floatinggate charge coupling effects that might otherwise arise from programminga memory cell on a WL above or below a given WL are greatly reduced oreliminated. Moreover, the neighbor memory cell on the bit line (BL) tothe right and to the left of each programmed memory cell is notprogrammed. Therefore, floating gate charge coupling effects that mightotherwise arise from programming a memory cell on a BL to the right andto the left are also greatly reduced or eliminated. The only remainingfloating gate charge coupling effect is in the diagonal direction, forexample, a programmed memory cell on BLn of WLn may still be affected bythe programmed cells on BLn−1 and BLn+1 of WLn+1 and/or WLn−1.

In the checkerboard pattern of FIGS. 11A and 11B, memory cells on everyother bit line (BL) for a given word line are programmed. However, it isnot required that the checkerboard pattern be this dense. In order toeliminate the above described diagonal coupling effect, in oneembodiment, memory cells on every fourth bit line for a given word lineare programmed. In another embodiment, it is not required that everyword line is programmed. For example, every fourth word line might beprogrammed. As an example, for WL0, the memory cells on BL0, BL4, BL8,etc. are programmed. WL1 and WL2 might remain unprogrammed. For WL3, thememory cells on BL2, BL6, BL10, etc. are programmed. Other checkerboardpatterns may be used. For example each WL might be programmed, but onlyevery fourth memory cell on each word line. In this case, the neighbormemory cell above and below a programmed memory cell remainsunprogrammed. In yet another embodiment, every other wordline and everyother bitline is programmed. For example, for WL0, the memory cells onBL0, BL2, BL4, etc. are programmed, WL1 remains unprogrammed. For WL2,BL0, BL2, BL4, etc. are programmed, or BL1, BL3, BL5, etc areprogrammed. In both latter cases, each programmed memory cell never hasa neighboring cell that is in a programmed state, not even in thediagonal direction and thus floating gate charge coupling effects arealmost completely eliminated.

FIG. 12 depicts one embodiment of a process 1200 programming SLC and MLCblocks using a checkerboard pattern for SLC blocks. For purposes ofdiscussion, an example of programming eight SLC blocks and transferringthat data to one MLC block will be discussed. In this example, it takeseight SLC blocks programmed in a checkerboard pattern to fill one MLCblock, although it may take more or fewer SLC blocks to fill an MLCblock. In step 1202, SLC blocks are erased. Thus, at least the eight SLCblocks are erased.

In step 1204, the controller 244 receives data to be stored in thememory array 200. For example, a host sends the controller 244 user datato store. In process 1200, the controller 244 determines that the datashould first be stored in SLC blocks prior to transferring the data toMLC blocks. As previously discussed, the SLC blocks may be used as atype of cache to temporarily store the data.

In step 1206, the controller 244 programs data in a checkerboardpattern. The controller 244 may program the data into more than one SLCblock. If so, the controller 244 might use one checkerboard pattern(e.g., FIG. 11A) for one SLC block and another checkerboard pattern(e.g., FIG. 11B) for another SLC block. However, for clarity ofdiscussion an example will be used in which the pattern of FIG. 11A isprogrammed in each of the SLC blocks.

In step 1208, the controller 244 determines whether enough SLC blocksare programmed in the checkerboard pattern to warrant transfer of thedata to one or more MLC blocks. For example, the controller 244determines that all eight SLC blocks are programmed. If not, process1200 returns to step 1204 to receive more data to be stored in thememory array at least until enough SLC blocks are programmed to warranttransfer to MLC blocks.

Note that it is not required that the data from the SLC blocks betransferred to an MLC block as soon as possible. The controller 244 maywait until memory access is idle (e.g., the host is not accessing thememory array 200) to transfer data from SLC blocks to one or more MLCblocks. In this case, other SLC blocks are programmed until anappropriate time to transfer the data to MLC blocks.

When the controller 244 determines it is appropriate, data istransferred from the SLC blocks to one or more MLC blocks, in step 1210.Note that at this point, some of the memory cells remained erased. Forexample, those memory cells that are not encircled in dashed line inFIG. 11A are still erased. In the present example, the controller 244reads in the data from all of the WLs of the eight SLC blocks, anddiscards the data from the memory cells that were not programmed,applies ECC to the data and then stores the data in the MLC block. It isnot required that the checkerboard programming be applied to MLC blocks.Thus, all memory cells of an MLC block may be programmed.

In step 1212, memory cells in the SLC blocks are erased. After step1212, process 1200 returns to step 1204 to receive more data to bestored in the memory array 200. However, this time data may be stored ina different checkerboard pattern (e.g., FIG. 11B). A count of theprogram/erase cycles for each checkerboard pattern (e.g., FIG. 11A, FIG.11B) may be maintained to allow for wear leveling. Note that it is notrequired that the checkerboard patterns depicted in FIGS. 11A and 11B beused for process 1200. For example, a less dense checkerboard pattern inwhich memory cells on every fourth bit line (and possibly every fourthword line) are programmed may be used.

FIG. 13 is a flow chart describing details of programming memory cells.The process 1300 of FIG. 13 can be performed in response to receiving arequest to program data. Process 1300 describes programming one wordline and may be repeated for each word line in a block. The phrase“programming a word line” means to program memory cells associated witha word line. In some embodiments, all memory cells on the word line areprogrammed at the same time. That is, memory cells associated with allbit lines (and a certain word line) are programmed together. In someembodiments, memory cells associated with odd bit lines are programmedseparately from memory cells associated with even bit lines.

If used to perform step 704 of process 700, process 1300 may beperformed once for each even word line in a block. If used to performstep 708, process 1300 may be performed once for each odd word line in ablock. If used to perform either step 1004 or 1008 of process 1000,process 1300 may be performed once for each word line in a block withsome word lines having odd bit lines programmed and others having evenbit lines programmed. Programming memory cells on only selected bitlines may be achieved by locking out programming on certain bit lines.

The order in which even (or odd) word lines are programmed is notlimited to a particular sequence. One example sequence is to programWL0, WL2, WL4, etc., until each even word line has been programmed. Forodd word lines, the sequence may be to program WL1, WL3, WL5, etc. untileach odd word line has been programmed. The word lines could beprogrammed in the opposite order (i.e., high to low). Also, it is notrequired that the word lines be programmed in sequence. For example, WL2might be programmed after programming WL0 and WL4.

In step 1312, the system will set the magnitude of the initial programpulse. At step 1314, the program count PC will be set to initially bezero. In step 1316, a program pulse is applied to the appropriate wordline(s). In step 1318, the memory cells to be programmed on that wordline(s) are verified to see if they have reached the target thresholdvoltage level. If all or almost all of the memory cells to be programmedhave reached the target threshold voltage level (step 1320), then theprogramming process has completed successfully (status=pass) in step1322. If not all or almost all of the memory cells have been verified,then it is determined in step 1324 whether the program count PC is lessthan 20. If the program count is not less than 20, then the programmingprocess has failed (step 1326). If the program count is less than 20,than in step 1328, the magnitude of program voltage signal Vpgm isincremented by the step size (e.g., 0.3V) for the next pulse and theprogram count PC is incremented. Note that those memory cells that havereached their target threshold voltage are locked out of programming forthe remainder of the current programming cycle. After step 1328, theprocess of FIG. 13 continues at step 1316 and the next program pulse isapplied. Note that another number than 20 for the program count PC canbe used as stop criteria in 1324. Also not that in 1320, in someembodiments, it is not necessary that all memory cells reach the targetthreshold voltage. Since ECC is applied, a certain number of memorycells that do not reach the target threshold voltage level can betolerated as these can be corrected by the ECC.

FIG. 14 is a flow chart describing a process 1400 for erasing memorycells. In some embodiments, some memory cells are only weakly erased andothers receive a normal erase. In other embodiments, all memory cellsreceive a normal erase. In some embodiments, different bias conditionsare applied to WLs to achieve either a normal or weak erase. The weakerase reduces stress on the memory cells.

Process 1400 is one implementation of steps 702, 706, or 710 of process700; steps 752 or 756 of process 750; steps 902 or 912 of process 900;steps 955 or 962 of process 950; steps 1002, 1006, or 1010 of process1000, or steps 1202 or 1212 of process 1200. In step 1402, the systemwill set the magnitude of the initial erase pulse. At step 1404, anerase loop count will be set to initially be zero. In step 1406, biasconditions are applied to word lines. In one embodiment different biasconditions are applied to word lines having memory cells to receive anormal erase than to word lines for which memory cells are to be weaklyerased. For example, 0V may be used for word lines to receive a normalerase and a positive voltage may be applied to word lines to be weaklyerased. As an example, the positive voltage may be a few volts (e.g.,1-4 Volts). In some embodiments, even and odd word lines have differentvoltages applied thereto. When discussing process 1400, the term “aselected word line” refers to a word line whose memory cells are toreceive a normal erase and the term “a selected memory cell” refers to amemory cell to receive a normal erase. The term “an unselected wordline” refers to a word line whose memory cells is to be weakly erased.Note that depending on the magnitude of the positive voltage on theunselected word lines, some erase may still occur, however, this “weakerase” is in general not significant and does not contribute to thedegradation of the memory cell's characteristics.

In step 1408, erase conditions are applied. In one implementation, step1402 includes raising the p-well to an erase voltage (e.g., 20 volts)for a sufficient period of time, grounding the selected word lines of aselected block and applying a bias voltage to unselected word lines ofthe selected block, while the source and bit lines are floating. Due tocapacitive coupling, bit lines, select lines, and the common source lineare also raised to a significant fraction of the erase voltage. A strongelectric field is thus applied to the tunnel oxide layers of selectedmemory cells and the data of the selected memory cells are erased aselectrons of the floating gates are emitted to the substrate side,typically by Fowler-Nordheim tunneling mechanism. As electrons aretransferred from the floating gate to the p-well region, the thresholdvoltage of a selected cell is lowered. Erasing can be performed on theentire memory array, on individual blocks, or another unit of cells.

However, by applying a bias voltage to the unselected word lines, theunselected memory cells do not experience such a strong electric fieldacross their tunnel oxide layers. Therefore, the unselected memory cellsdo not suffer significant stress from erase. In some embodiments, theunselected memory cells are already erased, therefore their thresholdvoltage will not be significantly altered by the above mentioned “weakerase” bias condition. However, note that it is not an absoluterequirement that the unselected memory cells be in the erased stateprior to beginning process 1400. That is, the unselected memory cellsmay contain data prior to being weakly erased.

In step 1410, a set of erase verify conditions are applied to the memorycells. This is a selective erase verify in some embodiments. Note thatthe verify conditions may be different for selected and unselected wordlines because some memory cells may be assumed to be erased even priorto the erase process. For example, if it is assumed that the unselectedmemory cells in the selected block are already erased because they werenot programmed since the last complete erase, then a read pass voltagemay be applied to unselected WLs.

In one implementation, step 1410 includes discharging bit lines toground, Then, a higher than zero voltage (e.g., 2.2V) is applied to thecommon source line and a certain voltage (e.g., 0V) is applied to theselected word lines and another voltage (e.g., Vread) is applied tounselected word lines. Vread may be a voltage that is sufficiently highsuch that the memory cells will conduct a current. Charge builds up onthe bit line, resulting in an increase of the bit line voltage of agiven NAND string until the body effect turns off at least one memorycell in the NAND string.

In step 1412, each of the NAND strings is sensed to determine whetherthe memory cells on the NAND string were sufficiently erased. Step 1406is performed after waiting for a predetermined period of time for thecharge to build up on the bit line. In one implementation, the voltageon a given bit line is compared to a reference value to determinewhether any of the memory cells on the corresponding NAND string have aVt that is above the target value. The target value could be a negativevalue.

In one embodiment, if it is detected that the Vt of each memory cell ona NAND string has reached the target level, then the data stored in thecorresponding data latch is changed to a logic “1.” If it is detectedthat the NAND string has at least one memory cell with a Vt that has notreached the appropriate target level, the data stored in thecorresponding data latch is not changed.

In step 1414, a determination is made as to whether enough NAND stringspassed erase verification. In one implementation, a certain number ofNAND strings are allowed to fail erase verification. For example,providing that fewer than 32 NAND strings failed erase verification, theoverall erase verification passes. If erase passed, then the erasestatus is set to pass and process 1400 ends.

If, at step 1414, it is determined that erase verification failed, thenthe loop count is checked (step 1416) to determine whether it is over alimit. If so, the erase status is set to fail and process 1400 ends. Ifthe loop count is not over the limit, then the erase voltage isincreased in step 1418. The erase voltage can be increased by anydesired amount such as 0.2 V, 0.5 V, 1.0 V, etc. The loop count isincremented. The new erase voltage is applied in step 1408.

As disclosed herein, one embodiment is a method of operatingnon-volatile storage having a group of plurality of non-volatile storageelements and a plurality of word lines. The method comprises erasing aplurality of non-volatile storage elements and programming data in afirst group of the plurality of non-volatile storage elements whileleaving unprogrammed a second group of the plurality of non-volatilestorage elements. For every non-volatile storage element in the firstgroup any neighbor non-volatile storage element on a word line eitherabove or below the non-volatile storage element in the first group is amember of the second group that remains unprogrammed. The data in atleast the first group of non-volatile storage elements are erased whilethe second group of non-volatile storage elements remain unprogrammed.Later, the at least a portion of the second group of the non-volatilestorage elements are programmed. For every non-volatile storage elementin the portion of the second group any neighbor non-volatile storageelement on a word line either above or below the non-volatile storageelement in the portion of the second group remains unprogrammed. Later,data in the at least a portion of the second group of non-volatilestorage elements is erased while the neighbor non-volatile storageelements on word lines either above or below the non-volatile storageelement in the at least the portion of the second group remainunprogrammed.

In one embodiment, the plurality of non-volatile storage elementsdiscussed in the preceding paragraph are part of a block in which datais stored one bit per non-volatile storage element and the non-volatilestorage further includes multi-level blocks of non-volatile storageelements in which more than one bit of data is stored per non-volatilestorage element. In one embodiment, the method further comprisestransferring the data programmed in the first group of non-volatilestorage elements associated to one or more of the multi-level blockswhile the second group of non-volatile storage elements remainunprogrammed, and transferring the data programmed in the at least theportion of the second group of non-volatile storage elements to one ormore of the multi-level blocks while the neighbor non-volatile storageelements on word lines either above or below the non-volatile storageelements in the at least the portion of the second group remainunprogrammed.

In another embodiment, the first group of non-volatile storage elementsare non-volatile storage elements associated with both an even bit lineof the plurality of bit lines and an even word line of the pluralityword lines and both an odd bit line of the plurality of bit lines and anodd word line of the plurality word lines. The second group arenon-volatile storage elements associated with both an odd bit line andan even word line and both an even bit line and an odd word line. Inthis case, the at least a portion of the second group is the entiresecond group.

One embodiment is a method of operating non-volatile storage having aplurality of non-volatile storage elements and a plurality of word linesassociated with the plurality of non-volatile storage elements. Themethod includes erasing data in a first group of non-volatile storageelements, weakly erasing a second group of the non-volatile storageelements, programming data in the first group of the plurality ofnon-volatile storage elements while not programming the second group ofthe plurality of non-volatile storage elements; for every non-volatilestorage element in the first group any neighbor non-volatile storageelement on a word line either above or below the non-volatile storageelement in the first group is a member of the second group that is notprogrammed. The method further includes erasing data in the second groupof non-volatile storage elements at a time when the second group ofnon-volatile storage elements are still weakly erased, and programmingthe second group of the non-volatile storage elements; for everynon-volatile storage element in the second group any neighbornon-volatile storage element on a word line either above or below thenon-volatile storage element in the second group is not programmed.

One embodiment is a method of operating non-volatile storage comprisingsingle-level blocks and multi-level blocks. The single-level blocks eachinclude a plurality of word lines and a plurality of non-volatilestorage elements. The method includes erasing non-volatile storageelements associated with all word lines in a first single-level block ofthe single-level blocks, and programming data in non-volatile storageelements in a checkerboard pattern in the first single-level block. Atleast half of the non-volatile storage elements in the firstsingle-level block remain unprogrammed. The data from the non-volatilestorage elements associated with the checkerboard pattern in the firstsingle-level block is transferred to a first multi-level block of themulti-level blocks while the at least half of the non-volatile storageelements in the first single-level block remain erased. The data in thefirst single-level block is erased.

In a further embodiment, the checkerboard pattern of the previousparagraph is a first checkerboard pattern. Data is programmed innon-volatile storage elements in a second checkerboard pattern in thefirst single-level block. Non-volatile storage elements that wereprogrammed in the first checkerboard pattern are not programmed usingthe second checkerboard pattern. The data from the non-volatile storageelements associated with the second checkerboard pattern in the firstsingle-level block is transferred to a second multi-level block of themulti-level blocks while non-volatile storage elements associated withthe first checkerboard pattern line remain erased.

One embodiment is a non-volatile storage device comprising a pluralityof non-volatile storage elements, a plurality of word lines associatedwith the group of non-volatile storage elements. and one or moremanaging circuits in communication with the non-volatile storageelements. The one or more managing circuits erase the non-volatilestorage elements. The one or more managing circuits program data in afirst group of the plurality of non-volatile storage elements whileleaving unprogrammed a second group of the plurality of non-volatilestorage elements. For every non-volatile storage element in the firstgroup any neighbor non-volatile storage element on a word line eitherabove or below the non-volatile storage element in the first group is amember of the second group that remains unprogrammed. The one or moremanaging circuits erase the data in the first group of non-volatilestorage elements while the second group of non-volatile storage elementsremain unprogrammed. The one or more managing circuits program at leasta portion of the second group of the non-volatile storage elements. Forevery non-volatile storage element in the portion of the second groupany neighbor non-volatile storage element on a word line either above orbelow non-volatile storage element in the portion of the second groupremains unprogrammed. The one or more managing circuits erase data inthe at least the portion of the second group of non-volatile storageelements while the neighbor non-volatile storage elements on word lineseither above or below the non-volatile storage element in the at leastthe portion of the second group remain unprogrammed.

In one embodiment, the non-volatile storage device of the previousparagraph further includes a plurality of bit lines associated with thegroup of non-volatile storage elements. The first group includesnon-volatile storage elements associated with both an even bit line ofthe plurality of bit lines and an even word line of the plurality ofword lines and both an odd bit line of the plurality of bit lines and anodd word line of the plurality of bit lines. The second group includesnon-volatile storage elements associated with both an odd bit line andan even word line and both an even bit line and an odd word line. Inanother embodiment, the first group includes non-volatile storageelements associated with even word lines of the plurality of word linesand the second group includes non-volatile storage elements associatedwith odd word lines of the plurality of word lines.

One embodiment is a non-volatile storage device including a first groupof non-volatile storage elements, a second group of non-volatile storageelements, a plurality of word lines associated with the first group ofnon-volatile storage elements, and one or more managing circuits incommunication with the first group of non-volatile storage elements andthe second group of non-volatile storage elements. The one or moremanaging circuits store one bit of data per non-volatile storage elementin the first group. The one or more managing circuits store multiplebits of data per non-volatile storage element in the second group. Theone or more managing circuits erase the first group of non-volatilestorage elements. The one or more managing circuits program data innon-volatile storage elements associated with even word lines of theplurality word lines while leaving non-volatile storage elementsassociated with odd word lines of the plurality word lines erased. Theone or more managing circuits transfer the data from the non-volatilestorage elements associated with the even word lines to the second groupof non-volatile storage elements while non-volatile storage elementsassociated with the odd word lines remain erased. The one or moremanaging circuits erase the data in the non-volatile storage elementsassociated with the even word lines.

One embodiment is a non-volatile storage device comprising a first groupof non-volatile storage elements, a second group of non-volatile storageelements, a plurality of word lines associated with the first group ofnon-volatile storage elements, a plurality of bit lines associated withthe first group of non-volatile storage elements, and one or moremanaging circuits in communication with the first group of non-volatilestorage elements and the second group of non-volatile storage elements.The one or more managing circuits store one bit of data per non-volatilestorage element in the first group. The one or more managing circuitsstore multiple bits of data per non-volatile storage element in thesecond group. The one or more managing circuits erase the first group ofnon-volatile storage elements. The one or more managing circuits programdata in non-volatile storage elements in a checkerboard pattern in thefirst group. At least half of the non-volatile storage elements in thefirst group remain unprogrammed. The one or more managing circuitstransfer the data from the non-volatile storage elements associated withthe checkerboard pattern in the first group to a first subset ofnon-volatile storage element in the second group of non-volatile storageelements. The one or more managing circuits erase the data in the firstgroup.

In a further embodiment, the checkerboard pattern of the previousparagraph is a first checkerboard pattern. The one or more managingcircuits program data in non-volatile storage elements in a secondcheckerboard pattern in the first group. Non-volatile storage elementsthat were programmed in the first checkerboard pattern are notprogrammed using the second checkerboard pattern. The one or moremanaging circuits transfer the data from the non-volatile storageelements associated with the second checkerboard pattern in the firstgroup to a second subset of non-volatile storage elements in the secondgroup while non-volatile storage elements associated with the firstcheckerboard pattern remain erased.

Yet another embodiment is a method of operating non-volatile storagecomprising single-level blocks and multi-level blocks. The single-levelblocks each include a plurality of word lines and a plurality ofnon-volatile storage elements. The method includes: performing a normalerase of non-volatile storage elements associated with even word linesin a first block of the single-level blocks while weakly erasingnon-volatile storage elements associated with odd word lines in thefirst block, programming data in non-volatile storage elementsassociated with the even word lines of the plurality of word lines.Non-volatile storage elements associated with the odd word lines are notprogrammed. Data from the non-volatile storage elements associated withthe even word lines is transferred to one or more of the multi-levelcell blocks, the transferring occurs while non-volatile storage elementsassociated with the odd word lines remain weakly erased.

The foregoing detailed description of the invention has been presentedfor purposes of illustration and description. It is not intended to beexhaustive or to limit the invention to the precise form disclosed. Manymodifications and variations are possible in light of the aboveteaching. The described embodiments were chosen in order to best explainthe principles of the invention and its practical application, tothereby enable others skilled in the art to best utilize the inventionin various embodiments and with various modifications as are suited tothe particular use contemplated. It is intended that the scope of theinvention be defined by the claims appended hereto.

1. A method of operating non-volatile storage having a plurality ofnon-volatile storage elements and a plurality of word lines associatedwith the plurality of non-volatile storage elements, the methodcomprising: erasing the plurality of non-volatile storage elements;programming data in a first group of the plurality of non-volatilestorage elements while leaving unprogrammed a second group of theplurality of non-volatile storage elements, for every non-volatilestorage element in the first group any neighbor non-volatile storageelement on a word line either above or below the non-volatile storageelement in the first group is a member of the second group that remainsunprogrammed; erasing the data in at least the first group ofnon-volatile storage elements while the second group of non-volatilestorage elements remain unprogrammed; programming at least a portion ofthe second group of the non-volatile storage elements, for everynon-volatile storage element in the portion of the second group anyneighbor non-volatile storage element on a word line either above orbelow the non-volatile storage element in the portion of the secondgroup remains unprogrammed; and erasing data in the at least a portionof the second group of non-volatile storage elements while the neighbornon-volatile storage elements on word lines either above or below thenon-volatile storage element in the at least the portion of the secondgroup remain unprogrammed.
 2. The method of claim 1, wherein theplurality of non-volatile storage elements are part of a block in whichdata is stored one bit per non-volatile storage element and thenon-volatile storage further includes multi-level blocks of non-volatilestorage elements in which more than one bit of data is stored pernon-volatile storage element.
 3. The method of claim 2 furthercomprising: transferring the data programmed in the first group ofnon-volatile storage elements to one or more of the multi-level blockswhile the second group of non-volatile storage elements associatedremain unprogrammed; and transferring the data programmed in the atleast the portion of the second group of non-volatile storage elementsto one or more of the multi-level blocks while the neighbor non-volatilestorage elements on word lines either above or below the non-volatilestorage element in the at least the portion of the second group remainunprogrammed.
 4. The method of claim 1 wherein the first group arenon-volatile storage elements associated with both an even bit line ofthe plurality of bit lines and an even word line of the plurality wordlines and both an odd bit line of the plurality of bit lines and an oddword line of the plurality word lines, the second group are non-volatilestorage elements associated with both an odd bit line and an even wordline and both an even bit line and an odd word line, the at least aportion of the second group is the entire second group.
 5. The method ofclaim 1 wherein the first group are non-volatile storage elementsassociated with even word lines of the plurality of word lines and thesecond group are non-volatile storage elements associated with odd wordlines of the plurality of word lines, the at least a portion of thesecond group is the entire second group.
 6. The method of claim 5wherein the erasing the data in the first group of non-volatile storageelements while the second group of non-volatile storage elements remainunprogrammed includes performing a selective erase of the non-volatilestorage elements associated with the even word lines without erasing thenon-volatile storage elements associated with the odd word lines.
 7. Themethod of claim 6 wherein the performing a selective erase of thenon-volatile storage elements associated with the even word lineswithout erasing the non-volatile storage elements associated with theodd word lines includes performing an erase verify of the even wordlines without performing an erase verify of the odd word lines.
 8. Themethod of claim 7 wherein the erasing the data in the first group ofnon-volatile storage elements while the second group of non-volatilestorage elements remain unprogrammed includes: applying one or moreerase pulses to the plurality of non-volatile storage elements; applyinga first voltage to the odd word lines while applying the one or moreerase pulses; and applying a second voltage to the even word lines whileapplying the one or more erase pulses, the first voltage is higher thanthe second voltage.
 9. The method of claim 8 wherein the applying asecond voltage to the even word lines while applying the one or moreerase pulses creates a strong electric field across tunnel oxide layersof the non-volatile storage elements associated with the even wordlines, the applying a first voltage to the odd word lines while applyingthe one or more erase pulses creates a less strong electric field acrosstunnel oxide layers of the non-volatile storage elements associated withthe odd word lines.
 10. The method of claim 9 wherein the erasing thedata in the first group of non-volatile storage elements while thesecond group of non-volatile storage elements remain unprogrammedincludes performing an erase verify by: applying a read voltage to theodd word lines; applying a voltage that is less than the read voltage tothe even word lines; sensing a condition of a bit line associated with afirst subset of non-volatile storage elements of the plurality ofnon-volatile storage elements; and determining whether the first subsetof non-volatile storage elements are erased based on the condition. 11.The method of claim 1 wherein the programming data in a first groupincludes programming non-volatile storage elements that are associatedwith every third word line of the plurality of word lines.
 12. A methodof operating non-volatile storage having a plurality of non-volatilestorage elements and a plurality of word lines associated with theplurality of non-volatile storage elements, the method comprising:erasing data in a first group of non-volatile storage elements; weaklyerasing a second group of the non-volatile storage elements; programmingdata in the first group of the plurality of non-volatile storageelements while not programming the second group of the plurality ofnon-volatile storage elements, for every non-volatile storage element inthe first group any neighbor non-volatile storage element on a word lineeither above or below the non-volatile storage element in the firstgroup is a member of the second group that is not programmed; erasingdata in the second group of non-volatile storage elements at a time whenthe second group of non-volatile storage elements are still weaklyerased; programming the second group of the non-volatile storageelements, for every non-volatile storage element in the second group anyneighbor non-volatile storage element on a word line either above orbelow the non-volatile storage element in the second group is notprogrammed.
 13. A method of operating non-volatile storage comprisingsingle-level blocks and multi-level blocks, the single-level blocks eachincluding a plurality of word lines and a plurality of non-volatilestorage elements, said method comprising: erasing non-volatile storageelements associated with all word lines in a first block of thesingle-level blocks; programming data in non-volatile storage elementsassociated with even word lines of the plurality word lines,non-volatile storage elements associated with odd word lines of theplurality of word lines are unprogrammed; transferring the data from thenon-volatile storage elements associated with the even word lines to oneor more of the multi-level cell blocks, the transferring occurs whilenon-volatile storage elements associated with the odd word lines remainunprogrammed; erasing the data in the non-volatile storage elementsassociated with the even word lines.
 14. The method of claim 13, furthercomprising: programming data in the non-volatile storage elementsassociated with the odd word lines after erasing the data in thenon-volatile storage elements associated with the even word lines;transferring the data from the non-volatile storage elements associatedwith the odd word lines to one or more of the multi-level blocks, thetransferring occurs while non-volatile storage elements associated withthe even word lines remain erased; and erasing the data in thenon-volatile storage elements associated with the odd word lines. 15.The method of claim 13, wherein the erasing the data in the non-volatilestorage elements associated with the even word lines includes erasingonly the non-volatile storage elements associated with the even wordlines.
 16. The method of claim 15, wherein the erasing only thenon-volatile storage elements associated with the even word linesincludes: applying one or more erase pulses to the non-volatile storageelements in the first block; applying a first voltage to the even wordlines while applying the one or more erase pulses; and applying a secondvoltage to the odd word lines in the first block while applying the oneor more erase pulses, the first voltage is lower than the secondvoltage.
 17. A method of operating non-volatile storage comprisingsingle-level blocks and multi-level blocks, the single-level blocks eachincluding a plurality of word lines and a plurality of non-volatilestorage elements, said method comprising: erasing non-volatile storageelements associated with all word lines in at least a first single-levelblock of the single-level blocks; programming data in non-volatilestorage elements in a checkerboard pattern in the at least firstsingle-level block, at least half of the non-volatile storage elementsin the at least first single-level block remain erased; transferring thedata from the non-volatile storage elements associated with thecheckerboard pattern in the at least first single-level block to a firstmulti-level block of the multi-level blocks while the at least half ofthe non-volatile storage elements in the at least first single-levelblock remain erased; and erasing the data in the at least firstsingle-level block.
 18. The method of claim 17, wherein the checkerboardpattern is a first checkerboard pattern and further comprising:programming data in non-volatile storage elements in a secondcheckerboard pattern in the first single-level block, non-volatilestorage elements that were programmed in the first checkerboard patternare not programmed using the second checkerboard pattern; andtransferring the data from the non-volatile storage elements associatedwith the second checkerboard pattern in the first single-level block toa second multi-level block of the multi-level blocks, the transferringoccurs while non-volatile storage elements associated with the firstcheckerboard pattern line remain erased.
 19. The method of claim 18,wherein the second multi-level block and the first multi-level block arethe same multi-level block.
 20. A non-volatile storage devicecomprising: a plurality of non-volatile storage elements; a plurality ofword lines associated with the group of non-volatile storage elements;and one or more managing circuits in communication with the non-volatilestorage elements, the one or more managing circuits erase thenon-volatile storage elements, the one or more managing circuits programdata in a first group of the plurality of non-volatile storage elementswhile leaving unprogrammed a second group of the plurality ofnon-volatile storage elements, for every non-volatile storage element inthe first group any neighbor non-volatile storage element on a word lineeither above or below the non-volatile storage element in the firstgroup is a member of the second group that remains unprogrammed, the oneor more managing circuits erase the data in the first group ofnon-volatile storage elements while the second group of non-volatilestorage elements remain unprogrammed, the one or more managing circuitsprogram at least a portion of the second group of the non-volatilestorage elements, for every non-volatile storage element in the portionof the second group any neighbor non-volatile storage element on a wordline either above or below non-volatile storage element in the portionof the second group remains unprogrammed, the one or more managingcircuits erase data in the at least the portion of the second group ofnon-volatile storage elements while the neighbor non-volatile storageelements on word lines either above or below the non-volatile storageelement in the at least the portion of the second group remainunprogrammed.
 21. The non-volatile storage device of claim 20, furthercomprising a plurality of bit lines associated with the group ofnon-volatile storage elements, the first group includes non-volatilestorage elements associated with both an even bit line of the pluralityof bit lines and an even word line of the plurality of word lines andboth an odd bit line of the plurality of bit lines and an odd word lineof the plurality of bit lines, the second group includes non-volatilestorage elements associated with both an odd bit line and an even wordline and both an even bit line and an odd word line.
 22. Thenon-volatile storage device of claim 20, wherein the first groupincludes non-volatile storage elements associated with even word linesof the plurality of word lines and the second group includesnon-volatile storage elements associated with odd word lines of theplurality of word lines.
 23. The non-volatile storage device of claim22, wherein the one or more managing circuits perform a selective eraseof the non-volatile storage elements associated with the even word lineswithout erasing the non-volatile storage elements associated with theodd word lines.
 24. The non-volatile storage device of claim 23, whereinthe one or more managing circuits performing a selective erase of thenon-volatile storage elements associated with the even word linesincludes the one or more managing circuits applying one or more erasepulses to the plurality of non-volatile storage elements, the one ormore managing circuits apply a first voltage to the odd word lines whileapplying the one or more erase pulses, the one or more managing circuitsapply a second voltage to the even word lines while applying the one ormore erase pulses, the first voltage is higher than the second voltage.25. The non-volatile storage device of claim 24, wherein the one or moremanaging circuits applying a second voltage to the even word lines whileapplying the one or more erase pulses creates a strong electric fieldacross tunnel oxide layers of the non-volatile storage elementsassociated with the even word lines, the one or more managing circuitsapplying a first voltage to the odd word lines while applying the one ormore erase pulses does create a less strong electric field across tunneloxide layers of the non-volatile storage elements associated with theodd word lines.
 26. The non-volatile storage device of claim 20, whereinthe one or more managing circuits perform wear-leveling based on thenumber of times that the first group of non-volatile storage elementshave been erased and the number of times that the second group ofnon-volatile storage elements have been erased.
 27. The non-volatilestorage device of claim 20, wherein the plurality of non-volatilestorage elements are part of a block in which data is stored one bit pernon-volatile storage element and the non-volatile storage furtherincludes multi-level blocks of non-volatile storage elements in whichmore than one bit of data is stored per non-volatile storage element.28. The non-volatile storage device of claim 27, wherein the one or moremanaging circuits transfer the data programmed in the first group ofnon-volatile storage elements associated to one or more of themulti-level blocks while the second group of non-volatile storageelements associated remain unprogrammed, the one or more managingcircuits transfer the data programmed in the at least the portion of thesecond group of non-volatile storage elements to one or more of themulti-level blocks while the neighbor non-volatile storage elements onword lines either above or below the non-volatile storage element in theat least the portion of the second group remain unprogrammed.
 29. Anon-volatile storage device comprising: a first group of non-volatilestorage elements; a second group of non-volatile storage elements; aplurality of word lines associated with the first group of non-volatilestorage elements; and one or more managing circuits in communicationwith the first group of non-volatile storage elements and the secondgroup of non-volatile storage elements, the one or more managingcircuits store one bit of data per non-volatile storage element in thefirst group, the one or more managing circuits store multiple bits ofdata per non-volatile storage element in the second group, the one ormore managing circuits erase the first group of non-volatile storageelements, the one or more managing circuits program data in non-volatilestorage elements associated with even word lines of the plurality ofword lines while leaving non-volatile storage elements associated withodd word lines of the plurality of word lines erased, the one or moremanaging circuits transfer the data from the non-volatile storageelements associated with the even word lines to the second group ofnon-volatile storage elements while non-volatile storage elementsassociated with the odd word lines remain erased, the one or moremanaging circuits erase the data in the non-volatile storage elementsassociated with the even word lines.
 30. The non-volatile storage deviceof claim 29, wherein the one or more managing circuits program data inthe non-volatile storage elements associated with the odd word linesafter erasing the data in the non-volatile storage elements associatedwith the even word lines, the one or more managing circuits transfer thedata from the non-volatile storage elements associated with the odd wordlines to the second group of non-volatile storage elements whilenon-volatile storage elements associated with the even word lines remainerased, the one or more managing circuits erase the data in thenon-volatile storage elements associated with the odd word lines.
 31. Anon-volatile storage device comprising: a first group of non-volatilestorage elements; a second group of non-volatile storage elements; aplurality of word lines associated with the first group of non-volatilestorage elements; a plurality of bit lines associated with the firstgroup of non-volatile storage elements; and one or more managingcircuits in communication with the first group of non-volatile storageelements and the second group of non-volatile storage elements, the oneor more managing circuits store one bit of data per non-volatile storageelement in the first group, the one or more managing circuits storemultiple bits of data per non-volatile storage element in the secondgroup, the one or more managing circuits erase the first group ofnon-volatile storage elements, the one or more managing circuits programdata in non-volatile storage elements in a checkerboard pattern in thefirst group, at least half of the non-volatile storage elements in thefirst group remain unprogrammed, the one or more managing circuitstransfer the data from the non-volatile storage elements associated withthe checkerboard pattern in the first group to a first subset ofnon-volatile storage element in the second group of non-volatile storageelements while the at least half of the non-volatile storage elements inthe first group remain unprogrammed, the one or more managing circuitserase the data in the first group.
 32. The non-volatile storage deviceof claim 31, wherein the checkerboard pattern is a first checkerboardpattern, the one or more managing circuits program data in non-volatilestorage elements in a second checkerboard pattern in the first group,non-volatile storage elements that were programmed in the firstcheckerboard pattern are not programmed using the second checkerboardpattern, the one or more managing circuits transfer the data from thenon-volatile storage elements associated with the second checkerboardpattern in the first group to a second subset of non-volatile storageelements in the second group while non-volatile storage elementsassociated with the first checkerboard pattern remain erased.
 33. Amethod of operating non-volatile storage comprising single-level blocksand multi-level blocks, the single-level blocks each including aplurality of word lines and a plurality of non-volatile storageelements, the method comprising: performing a normal erase ofnon-volatile storage elements associated with even word lines in a firstblock of the single-level blocks while weakly erasing non-volatilestorage elements associated with odd word lines in the first block;programming data in non-volatile storage elements associated with theeven word lines of the plurality of word lines, non-volatile storageelements associated with the odd word lines are not programmed; andtransferring the data from the non-volatile storage elements associatedwith the even word lines to one or more of the multi-level cell blocks,the transferring occurs while non-volatile storage elements associatedwith the odd word lines remain weakly erased.
 34. The method of claim33, further comprising: performing a normal erase of non-volatilestorage elements associated with the odd word lines in the first blockwhile weakly erasing non-volatile storage elements associated with theeven word lines in the first block; programming data in non-volatilestorage elements associated with the odd word lines of the plurality ofword lines, non-volatile storage elements associated with the even wordlines are not programmed; and transferring the data from thenon-volatile storage elements associated with the odd word lines to oneor more of the multi-level cell blocks, the transferring occurs whilenon-volatile storage elements associated with the even word lines remainweakly erased.